59 research outputs found

    A study on text-score disagreement in online reviews

    Get PDF
    In this paper, we focus on online reviews and employ artificial intelligence tools, taken from the cognitive computing field, to help understanding the relationships between the textual part of the review and the assigned numerical score. We move from the intuitions that 1) a set of textual reviews expressing different sentiments may feature the same score (and vice-versa); and 2) detecting and analyzing the mismatches between the review content and the actual score may benefit both service providers and consumers, by highlighting specific factors of satisfaction (and dissatisfaction) in texts. To prove the intuitions, we adopt sentiment analysis techniques and we concentrate on hotel reviews, to find polarity mismatches therein. In particular, we first train a text classifier with a set of annotated hotel reviews, taken from the Booking website. Then, we analyze a large dataset, with around 160k hotel reviews collected from Tripadvisor, with the aim of detecting a polarity mismatch, indicating if the textual content of the review is in line, or not, with the associated score. Using well established artificial intelligence techniques and analyzing in depth the reviews featuring a mismatch between the text polarity and the score, we find that -on a scale of five stars- those reviews ranked with middle scores include a mixture of positive and negative aspects. The approach proposed here, beside acting as a polarity detector, provides an effective selection of reviews -on an initial very large dataset- that may allow both consumers and providers to focus directly on the review subset featuring a text/score disagreement, which conveniently convey to the user a summary of positive and negative features of the review target.Comment: This is the accepted version of the paper. The final version will be published in the Journal of Cognitive Computation, available at Springer via http://dx.doi.org/10.1007/s12559-017-9496-

    Psip1/Ledgf p52 Binds Methylated Histone H3K36 and Splicing Factors and Contributes to the Regulation of Alternative Splicing

    Get PDF
    Increasing evidence suggests that chromatin modifications have important roles in modulating constitutive or alternative splicing. Here we demonstrate that the PWWP domain of the chromatin-associated protein Psip1/Ledgf can specifically recognize tri-methylated H3K36 and that, like this histone modification, the Psip1 short (p52) isoform is enriched at active genes. We show that the p52, but not the long (p75), isoform of Psip1 co-localizes and interacts with Srsf1 and other proteins involved in mRNA processing. The level of H3K36me3 associated Srsf1 is reduced in Psip1 mutant cells and alternative splicing of specific genes is affected. Moreover, we show altered Srsf1 distribution around the alternatively spliced exons of these genes in Psip1 null cells. We propose that Psip1/p52, through its binding to both chromatin and splicing factors, might act to modulate splicing

    Signaling probabilities in ambiguity: who reacts to vague news?

    Get PDF
    Ambiguity affects decisions of people who exhibit a distaste of and require a premium for dealing with it. Do ambiguity-neutral subjects completely disregard ambiguity and respond to any vague news? We couple decision-making in ambiguity with a preliminary information processing stage, where news is used to test prior beliefs and, possibly but not necessarily, update them. All decision-makers, including ambiguity-neutral, recognize and account for ambiguity at this stage; higher confidence makes ambiguity-neutral subjects less susceptible to vague news. In a two-color Ellsberg experiment with imprecise signals about the unknown probability of success they are less likely to respond to signals; the difference between them and non-neutral to ambiguity subjects vanishes for high precision signals. Less than 60% subjects choose the ambiguous urn, even for high communicated probabilities of success, suggesting many participants, especially ambiguity-neutral, discard vague news at the information processing stage. JEL: C90, D01, D81, as well as seminar participants at ETH-Zürich, University of Essex, University of Glasgow and University of Hamburg, and participants of iCare conference at HSE in Perm and JE on Ambiguity and Strategic Interactions at the University of Grenoble for helpful comments, suggestions and encouragement. All remaining errors are ours

    Pan-cancer analysis of whole genomes

    Get PDF
    Cancer is driven by genetic change, and the advent of massively parallel sequencing has enabled systematic documentation of this variation at the whole-genome scale(1-3). Here we report the integrative analysis of 2,658 whole-cancer genomes and their matching normal tissues across 38 tumour types from the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA). We describe the generation of the PCAWG resource, facilitated by international data sharing using compute clouds. On average, cancer genomes contained 4-5 driver mutations when combining coding and non-coding genomic elements; however, in around 5% of cases no drivers were identified, suggesting that cancer driver discovery is not yet complete. Chromothripsis, in which many clustered structural variants arise in a single catastrophic event, is frequently an early event in tumour evolution; in acral melanoma, for example, these events precede most somatic point mutations and affect several cancer-associated genes simultaneously. Cancers with abnormal telomere maintenance often originate from tissues with low replicative activity and show several mechanisms of preventing telomere attrition to critical levels. Common and rare germline variants affect patterns of somatic mutation, including point mutations, structural variants and somatic retrotransposition. A collection of papers from the PCAWG Consortium describes non-coding mutations that drive cancer beyond those in the TERT promoter(4); identifies new signatures of mutational processes that cause base substitutions, small insertions and deletions and structural variation(5,6); analyses timings and patterns of tumour evolution(7); describes the diverse transcriptional consequences of somatic mutation on splicing, expression levels, fusion genes and promoter activity(8,9); and evaluates a range of more-specialized features of cancer genomes(8,10-18).Peer reviewe

    Comparing Player Responses To Choice-Based Interactive Narratives Using Facial Expression Analysis

    No full text
    Interactive storytelling balances the desire to create dynamic, engaging experiences around characters and situations with the practical considerations of the cost of producing content. We describe a method for assessing player experience by analyzing player facial expressions following key content events in The Wolf Among Us by Telltale Games. Two metrics, engagement and valence, are extracted for six participants who play the first episode of the game. An analysis of the variance and distribution of responses relative to emotionally charged content events and choices suggests that content is designed around events that serve to anchor player emotions while providing the freedom to respond through emotionally-motivated choice selections and content elicitors

    Classification of Customer Reviews based on Sentiment Analysis

    No full text
    In this paper we propose a system that performs the classification of customer reviews of hotels by means of a sentiment analysis. We elaborate on a process to extract a domainspecific lexicon of semantically relevant words based on a given corpus (Scharl et al., 2003; Pak & Paroubek, 2010). The resulting lexicon backs the sentiment analysis for generating a classification of the reviews. The evaluation of the classification on test data shows that the proposed system performs better compared to a predefined baseline: if a customer review is classified as good or bad the classification is correct with a probability of about 90%
    corecore